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Abstract 

In this paper, we discuss a voting model with two candidates, Ci 
and C2. We set two types of voters-herders and independents. The 
voting of independent voters is based on their fundamental values; on 
the other hand, the voting of herders is based on the number of votes. 
Herders always select the majority of the previous r votes, which is 
visible to them. We call them digital herders. We can accurately 
calculate the distribution of votes for special cases. When r > 3, we 
find that a phase transition occurs at the upper limit of t, where t 
is the discrete time (or number of votes). As the fraction of herders 
increases, the model features a phase transition beyond which a state 
where most voters make the correct choice coexists with one where 
most of them are wrong. On the other hand, when r < 3, there is no 
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phase transition. In this case, the herders' performance is the same as 
that of the independent voters. Finally, we recognize the behavior of 
human beings by conducting simple experiments. 
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1 Introduction 



In general collective herding poses interesting problems in several fields. To 
cite a few examples in statistical physics, anomalous fluctuations in financial 
markets [1] [2] and opinion dynamics |3] have been related to percolation and 
the random field Ising model. To estimate public perception, people observe 
the actions of other individuals; then, they make a choice similar to that 
of others. Recently, these behaviors have been referred to as Kuki-wo-yomu 
(follow an atmosphere) in Japanese. Because it is usually sensible to do what 
other people are doing, the phenomenon is assumed to be the result of a 
rational choice. Nevertheless, this approach can sometimes lead to arbitrary 
or even erroneous decisions. This phenomenon is known as an information 
cascade 

A recent agent-based model proposed by Curty and Marsili [5] focused 
on the limitations imposed by herding on the efficiency of information ag- 
gregation. Specifically, it was shown that when the fraction of herders in a 
population of agents increases, the probability that herding yields the correct 
forecast (i.e., individual information bits are correctly aggregated) undergoes 
a transition to a state in which either all herders forecast rightly or no herder 
does. 

In the previous paper, we introduced a voting model that is similar to 
a Keynesian beauty contest [0] [Z] [B] • There are two types of voters-herders 
and independents-and two candidates. Herders are known as copycat voters; 
they vote for each candidate with the probabilities that are proportional to 
the candidates' votes. In the previous paper, they were known as analog 
herders. We investigated a case wherein all the voters are herders |S] . In such 
a case, the process is a Polya process, and the voting rate converges to a beta 
distribution in a large time limit |10j. Next, we doped independent voters 
in herders. The proposed voting model is a binomial distribution doped in 
a beta binomial distribution mathematically. In the upper limit of t, the 
independent voters make the distribution of votes converge to Dirac measure 
against herders. This model consists of three phases. If herders constitute 
the majority or even half of the total voters, the voting rate converges more 
slowly than it would in a binomial distribution. If independents constitute 
the majority of the voters, the voting rate converges at the same rate as it 
would in a binomial distribution. The phases differ in terms of the velocity 
of the convergence. If the independent voters vote for the correct candidate 
rather than for the wrong candidate, the model consists of no case wherein 



3 



the majority of the voters select the wrong answer. The herders affect only 
the speed of the convergence; they do not affect the voting rates for the 
correct candidate. 

The model introduced by Curty and Marsili has a limitation similar to 
that of our previous model in the case wherein voters are unable to see 
the votes of all the voters; they can only see the votes of previous voters. 
However, there is a significant difference between our model and their model 
with respect to the behavior of the herders. In their model, the herders 
always select the majority of the votes, which is visible to them. Thus, their 
behavior becomes digital (discontinuous). Digital herders have a stronger 
herding power than analog herders. 

Here, we discuss a voting model with two candidates, Cq and Ci. We 
set two types of voters-independent and herders. In this paper, the herders 
are digital herders, as in the case of the model introduced by Curty and 
Marsili. The voting of independent voters is based on their fundamental 
values. On the other hand, the voting of herders is based on the number of 
votes. Herders always select the majority of the previous r votes, which is 
visible to them. 

The remainder of this paper is organized as follows. In section 2, we 
introduce our voting model, and we mathematically define the two types of 
voters-independents and herders. The voters can see the previous r votes 
of the voters. In section 3, we calculate the exact distribution functions of 
the votes for the case wherein the voters can see the votes of all the vot- 
ers. We discuss the phase transition using the exact solutions. In section 
4, we discuss the special case, r — 1. In this case, we calculate the exact 
distribution function; however, there is no phase transition. In section 5, we 
analyze the model using mean field approximation. We can show that the 
phase transition in this system occurs when r > 3. In section 6, we describe 
numerical simulations performed to confirm the analytical results pertaining 
to the asymptotic behavior. In section 7, we conduct simple social experi- 
ments to recognize the behavior of human beings. Finally, the conclusions 
are presented in section 8. 



2 Model 

We model the voting of two candidates, Cq and Ci; at time t, they have 
Co(t) and Ci{t) votes, respectively. At each time step, one voter votes for 
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one candidate; the voting is sequential. Voters are allowed to see r previous 
votes for each candidate when they vote so that they are aware of public 
perception. If r > t, voters can see t previous votes for each candidate. At 
time t, the number of votes for Cq and Ci are Co(t) and c[{t), respectively. In 
the limit r — > oo, voters can see all previous votes. Therefore, c^{t) = co(t) 
and cf'it) = ci{t). 

There are two types of voters-independents and herders; we assume an 
infinite number of voters. Independent voters vote for Co and Ci with proba- 
bilities l — q and q, respectively. Their votes are independent of others' votes, 
i.e., their votes are based on their fundamental values. Here, we set Cq as the 
wrong candidate and Ci as the correct candidate to validate the performance 
of the herders. We can set g > 0.5 because we believe that independent vot- 
ers vote for the correct candidate Ci rather than for the wrong candidate Cq. 
In other words, we assume that the intelligence of the independent voters is 
virtually correct. 

On the other hand, herders vote for a majority candidate; if CQ(t) > 
c^(t), herders vote for the candidate Cq. If CQ{t) < c^(t), herders vote for 
the candidate Ci. If c^it) = c^(t), herders vote for Cq and Ci with the 
same probability, i.e. ,1/2. In the previous paper, the herders voted for each 
candidate with probabilities that were proportional to the candidates' votes 
[7]; they were known as analog herders. On the other hand, the herders in 
this paper are known as digital herders (Fig. [T]). 

The independent voters and herders appear randomly and vote. We set 
the ratio of independent voters to herders as (1 — p)/p. In this paper we 
mainly pay attention in large t limit. It means the voting of infinite voters. 

3 Exact solutions for r = oo 

In this section, we study the exact solution of the case r = oo by using 
combinatorics. 

Here, we map the model to correlated Brownian motion along the edges 
of a grid with square cells, and we count the number of paths. Let m and n 
be the horizontal axis and the vertical axis, respectively. The coordinates of 
the lower left corner are (0, 0); this is the starting point, m is the number of 
voters who vote for Ci, and n is the number of voters who vote for Cq. A 
path shows the history of the votes. If a voter votes for Ci, the path move 
rightwards. If a voter votes for Cq, the path move upwards. 
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Figure 1: Demonstration of model. R = Cg/{cg + €[}. 

We define Pi{m, n) as the probability that the {n + m + l)th voter votes 
for the candidate Cj, where i = 0, 1. The probability of moving upwards is 
as follows. 



The probability of moving rightwards is Pi{m,n) = 1 — Po{m,n) for each 
case. Here, we introduce X(m, n) as the probability that the path passes 
through the point (m,n). The master equation is 

X{m, n) = Pi{m — l,n)X{m — l,n) + -Po(^) — l)X(m, n — 1), (2) 

for m > and n > 0, with the initial condition X(0,0) = 1. This defines 
X{m, n) uniquely. Hereafter, we refer to the region m < n as /, m > n as 
J/, and m = n as III (Fig. [2]). 

First, we consider the case g = 1. At this limit, independent voters always 
vote for only one candidate, Ci (if we set g = 0, independent voters vote only 
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Figure 2: Voting and Path. A path shows the history of the votes. If a 
voter votes for Ci, the path move rightwards. If a voter votes for Cq, the 
path move upwards. The sample arrowed line shows the voting of 6 voters, 
0, 0, 1, 1, 1, 1. We refer to the regions m < n, m > n, and m = n as /, //, 
and III respectively. 



for Co). The probability is reduced from ([T]) to 
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(3) 



In this case, if the path enters // (m > n), it can only move rightwards. 
Hence, n = m — 1 becomes the absorption wall, where m,n > 0. There is a 
difference between the probability (|3]) in / (m < n) and that in /// [m = n). 
Then, we have to count the number of times the path touches the diagonal. 
Using f pOjl . we can calculate the distribution for m < n. (See Appendix 
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where Am^n,k is given by ( 15U]) and k is the number of times the path touches 
the diagonal. 

The distribution for m > n can be easily calculated for the absorption 
wall n = m — 1, where m,n > 0. The distribution for m > n is given by 

X{m, n) = }^ An,n,k — m>n. (5) 

k=o ^ 

We investigate the limit t — )■ oo. Here, we consider m as a variable; it 
is the distribution function of the vote for C\. For large t, we can assume 
that only the first terms of the summation of (jlj) and ^ are non-negligible. 
The first term becomes the difference of the binomial distributions using 
(!30|) . For mjt < 1/2, the peak of the binomial distribution is 1 — p and for 
m/t > 1/2, it is 1. Then, we can obtain the distribution in the scaling limit 
t = m + n — 7- oo, 

J^Z. (6) 

The probability measure of Z is 

/i = a5i_p + (35i, (7) 

where is Dirac measure. Z is the ratio of voters who vote to Ci from (|6]). 
The distribution has two peaks, one at Z = 1 and the other ai Z = 1 — p. 
Now, we calculate a and /3, where a + (5 = 1. The probability that the path 
touches the absorption wall = m — 1 is given by 

m=0 ^ m=Ofe=0 

= (1 - + ^co{x) + i^rc^ix) + {^rc,{x) + ■■■] 

= (1 - f )[1 + ^Coix) + i^nCoix)}' + i^nCoix)}' + ...] 

p ^ {xCo(x)}fe P. 1 

2 ^ 2^ 2 1 _ £^M^ 

fc=0 -'- 2 

4 - 2p 



3 + Jl-4p(l-p) 



(8) 



Ck{x) is the generating function of the generalized Catalan number f l35|) . and 
X = p(l — p). Here, we use the relations flMl) and flTTI) . 



8 




Figure 3: /3 or s, i.e., average votes for candidate Ci by herders. 



In Fig. [3l we plot (3 for the case q = I. We are interested in the average 
votes for Ci by the herders to vahdate the performance of the herders. We 
define s as the average votes for the correct candidate Ci by the herders, 

. = ^-<'-''>. (9) 

P 

Here, we take expected values of ([9]) about several sequences of voting, 

Z = ps + {l-p) = {l-p)a + P, (10) 

where x means the expected value of x. The second equality can be obtained 
from ([7]). Using the relation a + /3 = 1, we can obtain s = p. 

When p is less than 0.5, herding is a highly efficient strategy. The distri- 
bution of votes peaks when Z = 1. A majority of votes is necessary select 
the correct candidate Ci. At p = pc = 0.5, there is a phase transition. When 
p exceeds pc = 0.5, the distribution of votes has two peaks. In this case, a 
majority may select the wrong candidate Cq. In the language of game theory, 
this is a bad equilibrium. The probability of falling into bad equilibrium is 
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1 — 13. Var{Z), the variance of Z in the large t hmit is the order parameter. 
It is observed that Var{Z) is not differentiable at p = 1/2 (Fig. H]). Hence, 
the phase transition is of the second order. 

When p < pc, the distribution has one peak, and it does not depend on 
Po{m, m), which is the probabihty of the vote when the number of votes for 
Cq is the same as that for Ci. We can confirm that Var{Z) is in Fig. |H 
On the other hand, when p > p^, the hmit distribution depends on Poim, m); 
Po{m,m) is given by ([3]). We can confirm that Var{Z) is not in Fig. HI If 
herders are analog, Var{Z) is in all region of p. 




Figure 4: Var{Z), the variance of Z is the order parameter. Var{Z) is not 
differentiable at pc- Z is the ratio of voters who vote to Ci. 

Next, we consider the general q case. In this case, the path goes across 
the diagonal several times, n = m — 1 is no longer the absorption wall, where 
m,n > 0. Hence, it is difficult to calculate the exact solution for general q. 
However, as in the discussion of ([7]), we can obtain the limit shape of the 
distribution of the votes for Ci, 
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The probability measure of Z is 



/i = a5(i_p)g + /35p+(i_p)g, (12) 
where a + /3 = 1. When g = 1, flT2|) becomes ([7]). We can calculate /3 as 



/3 = /2i(l - i?2)(l + i?ii?2 + + ■■■) = ^'^^^y . (13) 

R\ is the probability that the path starts from (0,0), goes across the 
diagonal only once, and reaches the wall n = m — 1 in // (m > ra). i?i is the 
probability that the path starts from the wall n = m + 1 in / {m < n), goes 
across the diagonal only once, and reaches the wall n = m — 1 in II {m > n). 
i?2 is the probability that the path starts from the wall n = m — 1 in // 
(m > n), goes across the diagonal only once, and reaches the wall n = m + l 
in / {m < n). 

For example, the first term of ( !T3|) is the path that starts from (0, 0) and 
passes through / (m < n) or directly enters // (m > n). The path goes 
across the diagonal III (m = n) only once; the first step is rightwards. The 
second term is the path that starts from (0, 0), goes across the diagonal /// 
(m = n) three times, and enters // {m > n). We can calculate and 
i?2 similarly to ([8]) (See appendix B), 



Ri 



Ri 



2(1-5) 



2-7i(l- Jl-4A(1-A)) 



l-5)7i(l- \/l-4A(l-A)) 



B{2~^,{l-Jl-AAil-A))} 



J, ^ Bl.{l-^l-AC{l-C)) 

(1 - B){2 - 72(1 - ^1-4^(1-^))}' 



where A, B, and C are given by ([I]), 71 = B/A, and 72 = (1 — -B)/(l — C). 
In general, we can calculate the exact value of pc- 

Po=l-^. (15) 

As q increases, Pc increases. At Pc, the model features a phase transition 
beyond which a state where most agents make the correct forecasts coexists 
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with one where most of them are wrong. Thus, the effectiveness of herding 
decreases as q decreases. In the hmit q = 0.5, the phase transition disappears. 
The distribution becomes symmetric in this case. 

From the viewpoint of the herders being noise, if p is greater than pc, the 
vote ratios deviate considerably from the fundamental value q. Thus, digital 
herders account for greater noise than analog herders. Analog herders affect 
only the speed of convergence to the fundamental value. [7] Independent 
voters can not oppose digital herders. 



4 Exact solutions for r = 1 

Here, we discuss the cases r = 1 besides p 7^ Herders can see only a vote 
of the previous voter. We define Pi(t) as the probabihty that the (t + l)th 
voter votes for Cj, where i = 0,1. Here, t denotes the time. 

^'^'^-\ {l-p){l-q)^G ■.Yo{t-l) = 0. j ^^^^ 

Yi(t) = 1 indicates that at t, the voter votes for Cj. Yo{t — 1) = 1 indicates 
that the previous voter votes for Cq. On the other hand, = indicates 
that at t, the voter does not vote for Cj. Yo(^ ~ 1) = indicates that the 
previous voter votes for Ci. Thus, Yd=i ^(0 the total number of votes for 
Ci until t. Here, the relation Pi{t) = l — Po{t) holds. The initial distribution 
is 

Po{0) = ^p+{l~p){l-q). (17) 

The model was studied as a one- dimensional correlated random walk 
[TT][T2]. Here, we introduce X{m,n) as the probability distribution, m is 
the number of the voters who vote for Ci and n is the number of the voters 
who vote for Cq. The master equation is 

X{m,n) = Pi{t-l)X{m-l,n) + Po{t-l)X{m,n-l), (18) 

for m > and n > 0, with the initial condition X(0, 0) = 1. 
In the limit t — )■ 00, 



X{m,t- m) ^ N{qt, 



^(1-^) ,) (19) 
^(l-F)G^' 



^ When p = 1, all voters are herders, and the distribution becomes the limit shape of 
beta distribution, as discussed in [9] 
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where N{fi, cr^) is the normal distribution with mean fi and variance o"^. (See 
Theorem 3.1. in [TT]) F and G are given in fll6p . Hence, we can obtain the 
hmit shape of the distribution as 



m 
T 



Z. (20) 



The probabihty measure of Z is 

/i = S,. (21) 

Then, r = 1 involves no phase transition, and the majority of the voters do 
not select the wrong candidate Cq. 

The limit distribution is independent of the initial condition Po(0) because 
the distribution Z has only one peak. 



5 Mean field approximation 

We discussed the exact solutions of this model in the cases r = 1 and r = oo. 
A phase transition occurs in the case r = oo. On the other hand, there is no 
phase transition in the case r = 1. We must consider the case 1 < r < oo. 
In this section, we analyze phase transition using mean field approximation. 

We define P[{t) as the probability of that the {t + l)th voter votes for Ci, 
where i = 0,1. The voter can see the previous r voters' votes. 

r p+(l-p)(l-q) ■,ElzlrYoil)>r/2; ] 
Po^(t)= p/2 + (l-p)(l-g) ;ElzlrYoil)=r/2; (22) 
[ (l-p)(l-g) ■.j:lZlrYoil)<r/2. J 

YdZt^rYoil) gives the total votes for Cq from {t — r) to (t — 1). In other 
words, it is the total number of votes of the previous r voters for Co at t. 
The case J^jzl^j-YoiO = f'l'^ appears only when r is even. Here, the relation 
Plit) = 1 - P^[t) holds. When r = oo, 022]) reduces to (HD and when r = 1, 
di]) reduces to 

We focus on the probability of the selection of the correct candidate Ci. 
The distribution of Z is the limit shape of the distribution of votes for Ci. 
In general g, we can rewrite the first equality of (jH]) as 

Z = {l-p)q + ps. (23) 
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Here, from the viewpoint of the mean field approximation, s can be consid- 
ered as the sum of the probabihties of every combination of majorities in the 
reference of previous r votes. 

Mean field analysis is an approximation. We can not obtain quantitative 
conclusions from this analysis owing to two major reasons. First, the analysis 
does not use -Po(O) in f l22]) . Second, this approximation assumes independence 
of voters, which is not true. These errors are presented in the next section. 

When r is odd, 

E (l]z\i-zy-^^nr{z). (24) 

r+i \ y / 

y~ 2 

When r is even, from the definition of the behavior of the herder. 




z^i - zy-^ + -( ^.^] z\i - zy'^ 



r 

9 1^ ^' ' 2\r/2 



Z%l-Zy-^-<^ = Qr^i{Z). (25) 



The even case r becomes the odd case r — 1 from the viewpoint of mean field 
analysis. 

Fig. [5] shows the exact solutions of Z, and Fig. [3] shows the exact solu- 
tions of s. Here x is the expected value of x. Both are obtained from the 
conclusions of section 3 for the case r = oo. p is the percentage of herders and 
q is the percentage of correct answers of independent voters. Z increases with 
p up to the critical point f|T5|) . At the critical point, Z is maximum. Above 
the critical point, the distribution becomes the sum of two delta functions, 
and Z decreases as p increases. 

( l23i) and IHM are two self-consistent equations for s or Z. By substituting 
(EID in ([23]), we obtain 

s = nr{Z)p + {1 - p)q (26) 

(1) r = 1 and r = 2 

In this case, Qr{Z) = q- We can obtain s = q from fl24|) . Then, we can 
get s = Z = q. The ratio of herders' votes for Ci is constant, as is that of 
the independent voters' votes. There is no transition in these cases. This is 
consistent with the conclusion of section 4. 
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Figure 5: Average votes ratio for the correct candidate Ci in the case r = oo. 
The veritical axis is E{Z) = Z. At the critical point Pc, Z = E{Z) is 
maximum. 

(2) r = oo 
{i)Z > 1/2 

In this case, Qr{Z) = 1- We can obtain s = 1 and Z = p + {1 — p)q. 
(ii)Z < 1/2 

In this case, Vtr{Z) = 0. We can obtain s = 0, Z = {l~p)q, and the condition 
p > 1 — ^. Then, when r = oo, there is a phase transition at pc = 1 — ^■ 
When p < Pc, the herders always vote for the correct candidate Ci. On the 
other hand, when p > pc, there are two cases. One is the same as that when 
P < Pc- In the other case, the herders always vote for the wrong candidate 
Cq. This phenomenon is known as an information cascade. The conclusion is 
consistent with that of section 3; however, by the mean field approximation, 
we can not obtain the exact distributions that are obtained in section 3. 

(3) 3 < r < oo 

( 126|) admits one solution for p < Pc{r) (see Fig. EJ^a)) and three solutions for 
p > Pcij) (see Fig. El^b)). When p > Pc{r), the upper and lower solutions are 
stable solutions; on the other hand, the intermediate solution is an unstable 

15 



q=0.9 
q=0.8 
q=0.7 
q=0.6 





Figure 6: Solutions of self-consistent equation fl26|) in the case 3 < r < oo 
(a) p < Pc (b)p > Pc- Below the critical point p^ we can obtain one solution 
(a). On the other hand, above the critical point, we obtain three solutions. 
Two of them are stable and one is unstable (b). 

solution. Then, the two stable solutions attain good and bad equilibrium, 
respectively, and, the distribution becomes the sum of the two delta functions, 
as in the case r = oo (see section 3). 

6 Numerical Simulations 

In order to confirm the analytical results, we perform numerical simulations. 
We adopt two approaches, numerical integration of the master equation and 
Monte Carlo simulation for this model. The master equation is given by (11 8 1) 
and Pi{t) is given by ([22]). 



16 




0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 

P P 



Figure 7: Average votes ratio for the correct candidate Ci when q = 0.6. 
(a) numerical integration and (b) numerical integration and Monte Carlo 
simulation. The vertical axis is E{Z) = Z. 

Fig. [7] shows the average votes ratio for the correct candidate Ci. Fig. 
W^a) shows numerical integration at t = 10000. We can see that the even 
case r almost coincides with the odd case r — 1. The conclusion based on the 
previous section is reasonable (See and fl25|) ). Fig. Wljo) shows the nu- 
merical integration and Monte Carlo simulation. The number of simulations 
is 100000. We can check whether the Monte Carlo simulation is consistent 
with the numerical integration. In Fig. Ulh), we can also confirm the exact 
solution for r = oo. The case q = 0.6 in Fig. |5] corresponds to the case 
r = oo in Fig. El^b). We can observe clearly the indifferentiable point at pc 
in Fig. [5l On the other hand, the point in Fig. El^b) is smoother than the 
point in Fig. [51 If we increase the number of Monte Carlo simulations, the 
points will appear similar in Fig. Wljo) and Fig. [51 

Here, we investigate the maximum Z, i.e., the maximum probability of se- 
lecting the correct candidate Ci or maximum percentage of correct answers. 
If the voters can see the previous vote or two, there is no phase transition. 
The percentage of correct answers is constant, as is the percentage of correct 
answers of independent voters q. In this case, the lowest maximum percent- 
age of correct answers is shown (Fig. [Zl^b)). If the voters can see more than 
the previous 2 votes, a phase transition occurs. Above the critical point pc, 
the distribution has two peaks. As r increases, the critical point pc decreases. 

When the voters can see all the previous votes, we believe that the max- 
imum percentage of correct answers is the highest. For example, when we 
select the herding strategy, we collect as much information as we can. How- 
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ever, this is not true. When the voters can see the previous 21 votes, the 
maximum percentage of correct answers is the highest when q = 0.6. Too 
much information induces mistakes among the herders. It can be observed 
in collective behaviors of animal groups such as fish schools and bird flocks 
[13]. There may be limits to the information available to grouping individu- 
als. The average distance maintained between neighbors within pelagic fish 
schools is usually between three-tenths of body length and one body length. 
Individuals can change their position relative to others only on the basis of 
local information. They do not need information of the entire group. 

The maximum probability of the selecting the correct answer is at the 
critical point Pc when r = oo. This can be seen in Fig. El which is the exact 
solution for the case r = oo. On the other hand, for 3 < r < oo, the maximum 
probability of the selecting the correct answer is above the critical point pc- 
In this phase, the distribution of votes has two peaks. Thus, the possibility 
of the majority of voters selecting the wrong answer increases; however, the 
average probability of selecting the correct answer increases. We discussed 
the unstable solution in the 2 peaks phase in the previous section. 

As discussed in the previous section, we find that the conclusions of nu- 
merical integration and Monte Carlo simulation are inconsistent with the 
conclusion of mean field approximation analysis in some respects. For exam- 
ple, when r = 3 and q = 0.6, the critical point is at Pc = 0.78 from mean field 
analysis. At this point, we get Z = 0.89 by using this method. On the other 
hand, from the conclusion of the numerical simulations, we get Pc = 0.74 and 
Z = 0.65 (Fig. EH^b)). The rough estimate of the critical point pc can be 
calculated by mean field approximation; however, it is difficult to estimate 
Z. Mean field approximation is excessively optimistic because it does not 
use the information at t = 1 and J^jzl-r ^o(0 = r/2. When r = 1 and r = 2, 
the distribution Z has one peak; hence, it is independent of these conditions. 
Thus, the conclusions from mean fields approximation are consistent with 
those from numerical simulations. On the other hand, when 3 < r < oo, the 
distribution Z depends on these conditions. 

7 Social experiments 

We conducted simple social experiments for our model. We framed 200 ques- 
tions, each with two choices- knowledge and no knowledge. 31 participants 
answered these questions sequentially. First, they answered the questions 
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without any information about the others' answers, i.e., their answers were 
based on their own knowledge. Those who knew the answers selected the 
correct answers. Those who did not know the answers selected the correct 
answers with a probability of 0.5. Next, the participants were allowed to 
see the previous participants' answers. Those who did not know the answers 
referred to this information. We are interested in whether they referred to 
the information as digital herders or analog herders. 
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Figure 8: Distribution of correct answers, (a) r = and (b) r = oo. We can 
observe one peak at Z = m/t = 0.6 in (a). We can observe two peaks, one at 
Z = m/t = 0.2 and the other at Z = m/t = 1 in (b). The peak a.t Z = 0.2 
is attributed to the wrong answers caused by the information cascade. 



Fig. [8] shows the results of the social experiments, i.e., the distribution of 
the correct answers: (a) r = and (b) r = oo. The average correct answer 
ratio is 0.6 in the case of (a). Hence, the independent voters who knew 
the answers account for 0.2 of all voters, and the herders who did know the 
answers account for 0.8 of all the voters. If the herders are digital, we can 
apply the model described in section 3, with p = 0.8, q = I, and r = oo. 
From ([7]), the distribution of percentage of correct answers has peaks at 
Z = m/t = 0.2 and Z = m/t = 1. Although the number of votes is small 
and does not converge, the prediction can be recognized in Fig. [S] (b). The 
peak at Z = 0.2 is attributed to wrong answers caused by rational choices. 
This phenomenon is known as is an information cascade; it is caused by 
digital herders and not by analog herders. If the herders are analog herders, 
there is one peak aX Z = 1 and there is no information cascade in the case of 
Fig. El^a). We believe that in this case, almost all herders behave as digital 
herders. 
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8 Concluding Remarks 



We investigated a voting model that is similar to a Keynesian beauty contest. 
We calculated the exact solutions for the special cases r = 1 and r = oo, and 
we analyzed the general case using mean field approximation and numerical 
simulations. When r = 1 and r = 2, there is no phase transition. The 
percentage of correct answers is the same as that of independent voters. In 
this case, herders can not increase the percentage of correct answers. When 
r > 3, there is phase transition. As the fraction of herders increases, the 
model features a phase transition beyond which a state where most voters 
make the correct votes coexists with one where most of them are wrong. As 
r increases, the critical point decreases. The phase diagram is shown in Fig. 
m When r = oo, we can obtain the exact solutions. When r < 3 < oo, we 
can not obtain the critical point pc and the distribution precisely. In this 
case, mean field approximation analysis is inadequate. It is a problem that 
must be addressed in the future. 

The high critical point induces a low risk of phase transition. Ants 
use chemical signals called pheromones, and they behave as herders. The 
pheromones evaporate quickly. As an analogy, in our model, r is small. 
Thus, pheromones may amplify the limited intelligence of individual ants 
into something more powerful to avoid phase transition. 

We are also interested in the behavior of human beings. We conducted 
simple experiments for our model when r = and r = oo. Although the 
total number of votes is small and does not converge, in these experiments 
an information cascade is observed. This phenomenon was caused by digital 
herders and not by analog herders. If the herders are analog, the difference 
of the phase is only the velocity of the convergence. Analog herders do not 
lead to erroneous decisions in t = oo. On the other hand, if the herders 
are digital the distribution of the votes has two peaks. One represents good 
equilibrium and the other represents bad equilibrium. In the case of bad 
equilibrium, herders make erroneous decisions at t = oo. We can conclude 
that the information cascade is caused by the phase transition of digital 
herders. Detailed analysis of the experiments is a problem that must be 
addressed in the future. 

Finally, we comment on the relations between our model and the model 
introduced by Curty and Marsili [5J . The mean field equations of their model 
are same as the (^^, and The difference is as follows. (1) The 

number of the agents, A^, of their model is finite. (2) The interactive process 
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Figure 9: Phase diagram in space Pc and r. r is an integer. When r = 1 and 
r = 2, there is no phase transition. When r > 3, there is a phase transition. 
There are two phases. One is a one peak phase, the other is two peaks 
phase. Two peaks phase represents the information cascade. As r increases, 
the critical point decreases. 

is repeated until it converges. We are interested in (2), i.e., the interaction 
among the voters. In the future, we plan to investigate the effects of the 
interaction on the distributions of votes. 
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Appendix A Catalan Number 



Here, we consider the number of monotonic paths along the edges of a grid 
with square cells, which do not pass lower the diagonal. Let m and n be 
the horizontal axis and the vertical axis, respectively. The coordinates of 
the lower left corner are (0, 0). A monotonic path is one which starts in the 
lower left corner, finishes in the upper triangle (m, n), where < m < n, and 
consists entirely of edges pointing rightwards or upwards. 

The number of paths from (0, 0) to (m, n) can be calculated as 

_ {n - m + l){n + m)\ _ ( n + m\ f n + m \ , . 

m!(n + l)! ~ 1^ n )~[n + l)- ^^^^ 

These numbers are known as generalized Catalan number. 

If the finish point is {m,m), the number of paths becomes the Catalan 
number 

C„,„ = o„ = ^?4-=(2"Uf \" ), (28) 

m!(m + l)! \m j y m + 1 J 

Next, we compute the distribution of the number of the paths that start 
in the lower left corner, finish in the upper triangle (m, n), and touch the 
diagonal k times [15]. Let Am,n,k denote the number of paths that touch the 
diagonal k times. We get a simple recursion relation about Am,n,k, 

m—l 

Am,n,k ^ ] Cj-^m—j—l,n—j — l,k—li (^9) 
j=0 

for k > 0, n,m > 0, and m > k, with the initial condition Ao,o,o = 1- This 
defines the numbers Am,n,k uniquely, and it is easy to prove that 

{n — m + k){n + m — k — 1)\ 



n\{m — k)\ 

n + m — k — 1 \ 
n-1 j ~ 




(30) 



From (1271) and ( l30ll . we can obtain the relation: 

Am,m,,k C'ni—k,m—l- (^-^) 
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The well known generating function Cq{x) of Catalan numbers is given 

by 

oo 

71=0 

= l + x + 2x2 + 5x3 + 14x^ + 42a;^ + 132x^ + ---, (32) 
subject to the algebraic relation 

xCoix)' = Co{x) - 1, (33) 

and we can obtain 

Coix) = . (34) 

Here, we obtain the generating function Am,k{x) of Am,m, 



k ■ 



oo 



m—k=0 m—k=0 
oo 

= T.Ci,i+k-ix' = Ck-iix). (35) 
/=o 

We use fl3T]) for the second equality. Cj{x) = Ci^i+jx'' is the generating 
function of the generalized Catalan number fl27|) . The generating function of 
the generalized Catalan number is given by 

oo 

,m+lX 

n=0 

= l + 2x + 5x^ + Ux^ + 42x^ + 132x^ + 429x^ + ■■■, 

oo 

,m+2X 

n=0 

= l + 3x + 9x^ + 28x^ + 90x^ + 297x^ + lOOlx'^ + ■ ■ ■ , 

00 

,m+3X 

n=0 

= 1 + 4x + 14x^ + 48x^ + 165x^ + 572x^ 
From (129|) . we can obtain 

Cj{x) =Cj-i{x)Co{x). (36) 
Thus, the simple relation between the generating functions is given by 

Cjix) = {Co(x)H+\ (37) 
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Appendix B Derivation of and R2 



Figure 10: Ri, and R2. Ri is the probability that the path starts from 
(0,0), goes across the diagonal only once, and reaches the wall n = m — 1 
in II {m > n). Ri is the probability that the path starts from the wall 
n = m + 1'm. I {m < n), goes across the diagonal only once, and reaches the 
wall n = m — 1 in II {m > n). i?2 is the probability that the path starts 
from the wall n = m — 1 in II {m > n), goes across the diagonal only once, 
and reaches the wall n = m + 1 in I {m < n). 

Ri is the probability that the path starts from (0,0), goes across the 
diagonal only once, and reaches the wall n = m — 1 in II {m > n) (Fig. [T(I]) . 



i?i = {i-B)[l + y^^Co{y) + {yiiyC,{y) + {y^)'C2{y) + ---] 

= {1-B)[1 + yjiCoiy) + {yjif{Co{y)V + {yiif{Co{y)}' + ■ ■ ■] 



n=m+1 




n=m-1 



m 



oo 



1-B 
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^ 2(1 -B) (33) 

2-',i(l- ^l-iA(l- A)) 

where A and B are given by ([1]), 71 = B /A, and y = A{1 — A). Ck{y) is the 
generation function of the generahzed Catalan number fl35|) . Here, we use 
the relations ([37]) and ([M]). When g = 1, (|38]) reduces to ([8]). 

Ri is the probability that the path starts from the wall n = m + 1 in / 
{m < n), goes across the diagonal only once, and reaches the wall n = m — 1 
in II {m > n). 

Ri = ^^[yiiCoiy) + {yii)'CM + {yii?C2{y) + ■ ■ ■] 

= ^-^[yiiCoiy) + {yiif{Co{y)r + (y^^nCoiy)}' + ■ ■ ■] 



l-B R, (l-i?)7i(l-Vl-4A(l-A)) 
B B{2-^r{l-Jl-AA{l-A))} 



R2 is the probability that the path starts from the wall n = m — 1 in // 
{m > n), goes across the diagonal only once, and reaches the wall n = m + l 
in / {m < n). 

^2 = Y^[^^2Co(^) + (^72)'C^i(^) + (^72)'C^2(^) + ---] 



5' 



B^2il-Jl-^C{1-C)) 



(1-B){2-^,{1-^1-4C{1-C))} 
where C is given by 72 = (1 - B)/{1 - C), and z = C{1 - C). 
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